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ABSTRACT 

Far infrared cooling lines are ubiquitous features in the spectra of star forming galaxies. Surveys of 
redshifted fine-structure lines provide a promising new tool to study structure formation and galactic 
evolution at redshifts including the epoch of reionization as well as the peak of star formation. Unlike 
neutral hydrogen surveys, where the 21 cm line is the only bright line, surveys of red-shifted fine- 
structure lines suffer from confusion generated by line broadening, spectral overlap of different lines, 
and the crowding of sources with redshift. We use simulations to investigate the resulting spectral 
confusion and derive observing parameters to minimize these effects in pencil-beam surveys of red¬ 
shifted far-IR line emission. We generate simulated spectra of the 17 brightest far-IR lines in galaxies, 
covering the 150 to 1300 /im wavelength region corresponding to redshifts 0 < z < 7, and develop a 
simple iterative algorithm that successfully identifies the 158/rm [Cn] line and other lines. Although the 
[Cn] line is a principal coolant for the interstellar medium, the assumption that the brightest observed 
lines in a given line of sight are always [Cn] lines is a poor approximation to the simulated spectra once 
other lines are included. Blind line identification requires detection of fainter companion lines from 
the same host galaxies, driving survey sensitivity requirements. The observations require moderate 
spectral resolution 700 < R < 4000 with angular resolution between 20" and 10', sufficiently narrow 
to minimize confusion yet sufficiently large to include a statistically meaningful number of sources. 
Subject headings: cosmology: observations, methods: data analysis, galaxies: high redshift, line: 
identification 


1. INTRODUCTION 

Line emission or absorption is a promising probe of 
the high-redshift universe. Considerable theoretical and 
instrumental effort has been devoted to the use of the 
redshifted_21 cm li ne o f neu tral hydrogen in such a fash¬ 
ion (see lPritchard and Loeb I (1 2012 1 for a recent review). 
Comparable observations using atomic or molecular lines 
at far-infrared wavelengths have seen less development. 
Recent advances in far-IR instrumentation, combining 
sensitive receivers with large collecting area, have led to 
the detection of far-IR lines in individual sources at cos¬ 
mological distances, raising the prospects for a future 
generation of far-IR line surveys as a probe of the high- 
redshift universe. 

Infrared fine structure lines are important coolants 
of the neutral and ionized gas phases of normal and 
starburst galaxies. They are easily excited, arise from 
the most abundant metals with low ionization potential, 
and are generally not affected by galactic or intergalactic 
attenuation. The strongest line is the 2 P 3 / 2 —> 2 P\/i 
fine-structure line of singly-ionized carbon at 157.74 /im 
rest wavelength. The ubiquity of carbon in the interstel¬ 
lar medium combined with the relatively low ionization 
potential and modest excitation temperature make the 
[Cn] line bright, with as much as 0.1 - 1% of the total 
bolometric l uminosity of the host galaxy emitted i n this 
single line (iCrawford et akl 119851: iStacev et al. I 119911 : 
iWright et al. I 119911 : iLord et al. I 1 1 9961) . The intrinsic 
line brightness and lack of significant attenuation from 
the galactic or intergalactic medium in the far infrared 
in turn render the [Cii] line visible over cosmological 
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The ability to detect [Cii] emission to redshifts z > 7 
makes the [Cii] line a promising candidate for cosmo¬ 
logical surveys. [Cii] emission originates from multiple 
phases of the interstellar medium. The combined emis¬ 
sion from all phases serves as a marker for source redshift 
for surveys of lar ge-scale structure (|Gong et al. 1 120111 : 
lUzgil et al. II20141) . The empirical relation between [Cii] 
and the bolometric far-IR luminosity allows the observed 
FClll intensity to serve as a t racer of star-formation activ¬ 
ity (|De Looze et al. I l201ll 12014 iHerrera-Camus et al. I 
l2014lf . while the ratio of [Cii] to other far-IR lines probes 
physical conditions within di fferent phases of interstellar 
medium of the host galax i es (I Abel 112006 : iCormier et al. I 
l2012t iCroxall et al. Il2012t iKapala et al. 120141) . 

Several authors have discussed requi rements f or [Cnl 
surveys at cosmological distances. Ida Cunha et al. I 
( 2013a ) estimate that a 500-hour observation of the 
2.4' diameter Hubble Deep Field with the Atacama Large 
Millimeter Array could yield 15 [Cii] detections over the 
redshift range 1 < 2 < 6. The sub-arcsecond angular 
resolution available for such interferometric observations 
minimizes confusion from source superposition within 
each synthesized beam, but the large number of inde¬ 
pendent pointings needed to map an area comparable to 
the HDF minimizes the integration time spent on any 
individual source so that only the brightest sources are 
detected. 

Observations at coarser angular resolution reduce the 
need for multiple pointings, at the cost of introducing 
competing lines from foreground sources. Intensity map- 
























































2 


Kogut et al. 


ping is one such technique, using fluctuations in the spa¬ 
tial and spectral distribution of line emission from a su¬ 
perposition of sources to derive the underlying power 
spectrum of the i ndividual, un r esolve d sources (see, e.q., 
the di s cussion in | Righi et al. I (|2008H : iVisbal and Loeb I 
(l2010fl ; iGong et al. (120111) ) Several such surv eys have 
been proposed. For example, lUzgil et al. I (12014H present 
a concept for [Cii] intensity mapping using a 25-beam 
grating spectrome ter each with beam width 10", while 
iSilva et al~l ; 201 l i propose a suite of single-beam spec¬ 
trometers each with 30" beam width. 

The larger beam size anticipated for such surveys 
facilitates mapping large areas but requires additional 
analysis to separate the target line emission from the 
foreground created by emission from other galaxies 
along the same line of sight. Several authors dis¬ 
cuss cross-correlation techniques to minimize f oreg round 
confusion in line inten sity mapping (iRjghi et al. 1 120081 
IVisbal and Loeb Il2010f) . For a given target redshift, the 
technique defines two observing wavelengths correspond¬ 
ing to emission from two different rest-frame lines. Maps 
of line emission taken at each observing wavelength con¬ 
tain spatially correlated structure corresponding to the 
two lines emitted by each source galaxy. The contribu¬ 
tion from other lines at different rest-frame wavelengths 
redshifted into the observing wavelengths must originate 
from sources at different redshifts. Spatial correlations 
from such distant sources will be weaker, so that the 
cross-correlation traces the power spectrum of the source 
galaxies at the target redshift. 

Intensity mapping surveys large volumes of the uni¬ 
verse to derive statistical estimates of structure for¬ 
mation, but provides little information on individual 
sources. In this paper, we investigate the complemen¬ 
tary technique of direct line identification in narrow- 
beam spectroscopic surveys. If the [Cii] line were the 
only observable far-IR line, redshift identification would 
be straightforward. In practice, however, the [Cii] line 
is merely one of several cooling lines. The problem of 
sorting through an observed spectrum containing mul¬ 
tiple lines from multiple galaxies at different redshifts 
presents both instrumental and analytical challenges. We 
use a toy model of far-IR line emission to evaluate a sim¬ 
ple algorithm for line identification and estimate relevant 
instrumental parameters (angular resolution, sensitivity, 
and spectroscopic resolution) required to derive cosmo- 
logically interesting results. The results are applicable 
for spectral cleaning of intensity mapping surveys as well 
as pencil-beam surveys of more limited cosmological vol¬ 
ume. 


2. METHODOLOGY 

We use a simple model of the brightest far-IR lines to 
simulate the observed line spectra from distant galaxies. 
We assume that the total luminosity, £{vq) emitted in a 
single line at rest frequency Po may be simply related to 
the total infrared luminosity, of a galaxy, 

= f(vo,L IR )L IR . (1) 

The observed specific flux of a single line from a set of 
IR galaxies at redshift z is then given by 
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Fig. 1. — Rest-frame wavelength and mean relative lumi¬ 
nosity for the far-I R li nes u sed in this study (see Table 1 in 
IVisbal. Trac. and Loeb~I (1201 li t f. 
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where +(L IR ,z) is the galaxy IR luminosity function, 
cLl(z ) is the luminosity distance, v' = u(l +z) is the red- 
shifted frequency, and (jiviy) is the intrinsic line profile, 
normalized so that f <f) v {y)dv = 1. Ignoring the effects 
of gravitational lensing, the total line intensity emitted 
from galaxies within a redshift interval zq to the present 
(z = 0) then becomes 
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If the ratio of line intensity to IR luminosity is a constant, 
Eq. f3j simplifies to 
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( 4 ) 

( 5 ) 


n{z) is the comoving number density of galaxies, and 


f L $(L) dL 
* = J*(L)dL 


( 6 ) 


is their average IR luminosity. 

We simulate the far-IR line spectra using a standard 
cosmology with h = 0.7, = 0.28, and Ga = 0.72 

(|Hinshaw et al. I [2 01 31) . We divide the comoving volume 
into redshift bins and compute the total number of galax¬ 
ies N(z) within each bin 


dN(z) dV c (z ) 

— = n{z) ^r 


m 


where V c (z) is the comoving volume at redshift z. We use 
the resulting number density to generate Monte Carlo re- 
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alizations of the far-IR line spectra. The background cos¬ 
mology fixes the mean number of galaxies within a fixed 
observing solid angle. We assign each simula ted galaxy 
a reds hift and IR luminosity, using the iSaunders et, al. I 
( 1990 1 luminosity function to obtain a representative lu¬ 
minosity distribution within each redshift bin. 

Each galaxy then contributes a set of far-IR lines. Fig¬ 
ure |T| shows the relative intensity of the rest-frame lines 
used for this study. We include the [Cll] line at 158 /im, 
[Cl] at 610 and 371 /mi, [Om] at 88 and 52 pn i, [Oi] at 
145 and 63 fin i, [Nlll] at 57 fim, and [Nil] at 205 and 122 
pm, as well as the CO (J —>• J— 1) series through J = 7. 
The total luminosity in each line is related to that of 
the host galaxy by the factor /(Vo) (Eq. [I]). We set the 
mean value of / f or eac h line using the rat ios of values in 
Table 1 of iVisbal. Trac. and Loebl (120Ilf) . For simplic¬ 
ity, we do not explicitly calculate the star formation rate 
for each simulated galaxy, but relate the comoving star 
formation rate p(z) to the IR luminosity density L{z) as 

C{z) « Ap{z) (8) 

where C{z) is the comoving IR luminosity density, p{z) 
is the comoving cosmic star formation rate (CSFR) in 
M 0 yr _1 Mpc -3 , and A = 6.7 x 10 9 is a conversion fac¬ 
tor for a Salpeter stellar initial mass function (Kennicutt 
2012 ). 

The relative luminosities from different lines within a 
source are known to vary by roughly a factor of 10. We 
account for such variation by multiplying f(u o) for each 
line in each host galaxy by a random number drawn uni¬ 
formly from the range [0.3 - 3]. We additionally account 
for the systematic variation in line luminosity and line 
ratios du e to the increased CMB temperature at higher 
redshift (Ida Cunha et al. Il2013blh We assume that the 
host galaxies are not spatially resolved and assign a ran¬ 
dom rotational linewidth to each galaxy uniformly drawn 
from the range [30 300] km s -1 . Each galaxy con¬ 

tributes multiple lines; all lines from a single galaxy have 
the same rotational line broadening. Finally, we accu¬ 
mulate the lines from all source galaxies within the sam¬ 
pled volume and redshift each line to obtain a realization 
the observed far-IR spectrum. The effects of instrument 
spectroscopic resolution are discussed in §3.2. We ignore 
instrument noise and continuum emission from the su¬ 
perposed host galaxies to focus here on the problem of 
line identification. 

Figure [2] shows a typical spectrum with 2' angular res¬ 
olution at wavelengths corresponding to redshifts 4.4 < 
z < 4.5 for the [Cn] line. Two problems are evident. 
The first problem is confusion. Unlike radio observa¬ 
tions, where the redshifted 21 cm line is the only bright 
line, a number of lines contribute to the observed far- 
IR spectra. Only 16% of the lines within the observed 
wavelength range correspond to the targeted [Cll] line, 
with the remainder consisting of “interloper” lines orig¬ 
inating at both higher and lower redshifts from species 
other than [Cll]. The second problem is blending: the 
superposition of rotationally broadened lines produces a 
complicated spectrum. The fainter lines merge to form 
a near continuum spectrum, further hindering line iden¬ 
tification. 

3. LINE IDENTIFICATION 


Although the [Cll] line is bright, simply assuming that 
the brightest observed lines within a single field are al¬ 
ways [Cn] lines produces a high error rate for line identifi¬ 
cation. We quantify this by taking the simulated spectra 
and selecting the brightest 10% of lines within a nar¬ 
row range centered at different source redshifts. We may 
then count how many of the brightest lines in each range 
are in fact [Cll] lines as opposed to interloper lines from 
sources at other redshifts. The probability that a given 
line within the brightest 10% is in fact a [Cll] line falls 
from 0.8 for observations near 315 fim ( z s = 1) to 0.4 for 
observations at 1100 fim ( z s = 6). 

We address the problem of blind line identification 
using a version of the radio-astronomical CLEAN algo¬ 
rithm. Although the brightest lines are not necessarily 
[Cll] lines, the [Cll] lines are generally bright. This sug¬ 
gests an iterative process. We first select the brightest 
line within the observed spectrum. Assuming that this 
“parent” line is [Cll] fixes the corresponding source red¬ 
shift z s . We then predict the wavelengths at which other 
far-IR lines originating from the same source should be 
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Fig. 2.— Simulated spectrum with red-shifted lines. The wave¬ 
length range corresponds to redshift 4.4 < z < 4.5 for the [Cll] 
line. (Top) Individual line flux, shown at the central wavelength 
before rotational broadening. Lines are color-coded by species us¬ 
ing the same color coding as Figure Q] 48 lines from 48 individual 
galaxies at 0.6 < z < 8 fall within the plotted wavelength range. 
The assumption that the brightest lines are always [Cll] lines pro¬ 
duces a high error rate, with roughly a third of the brightest lines 
originating from species other than [Cll]. (Bottom) The same lines 
are shown after rotational broadening. The superposition of rota¬ 
tionally broadened lines produces a complicated spectrum (black 
curve). 
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Fig. 3.— Results of the CLEAN algorithm compared to simulation 
inputs for a simulated spectrum from 580 fim to 1160 pm contain¬ 
ing 390 peaks (see text). (Top) Cumulative number of correct (solid 
line) and incorrect (dashed) [Cn] line identifications as a function of 
the relative line intensity, sorted from brightest (R = 0) to faintest 
(R = 300). (Bottom) Probability that the i? th -brightest peak in 
the observed spectra will be correctly identified. The number of 
false identifications is small for the brightest half of the observed 
lines. 

observed, 

^pred = ^rest (Id - ^s)i (9) 

and examine the spectra at those wavelengths. If the 
expected “companion” lines are found, the redshift iden¬ 
tification is confirmed and both the candidate [Cil] line 
and the detected companion lines are flagged as known 
and removed from the list of lines to search. If no com¬ 
panion lines are found, we repeat the test assuming that 
the parent line is the next-most-conrmon far-IR line (typ¬ 
ically a CO line) and continue down the list of far-IR lines 
until we identify a parent/companion combination or ex¬ 
haust the list of potential parent lines. If no companion 
lines are found, the bright parent line is flagged as un¬ 
known and removed from the list of possible parent lines, 
although it may later be identified as a companion line. 
We then proceed to the brightest remaining un-flagged 
line and iterate until all lines are tested. 

3 . 1 . Efficiency and Error Rate 

We test the algorithm using Monte Carlo realizations of 
the observed spectra and compare the derived identifica¬ 
tions to the known Monte Carlo inputs to determine the 
detection efficiency and error rate. Figure [3] shows the re¬ 
sults obtained from a single 2' beam observing one octave 


in wavelength from 580 /jm to 1160 /Ltm, corresponding 
to [Cn] redshifts 2.7 < z s < 6.3. The input spectrum 
contains 1600 spectral lines from 550 distinct galaxies, 
including 260 [Cn] lines. After rotational broadening, 
the observed superposed spectrum contains 390 peakfl 
110 of which result from a [Cil] line. Blending is signifi¬ 
cant: of the 390 peaks in the superposed spectra, 290 are 
dominated by a single line and 100 are blends of multiple 
lines. The un-blended peaks comprise 175 unique host 
galaxies. 

The line identification algorithm starts with the bright¬ 
est such peak and works to progressively fainter peaks. 
The algorithm correctly identifies 200 lines correspond¬ 
ing to 80 unique parent redshifts and 120 companion 
lines. 40 of the correctly identified parent lines result 
from [Cil] emission, with the remainder primarily from 
lower-redshift CO lines. The detection efficiency is thus 
15% of the 260 [Cil] lines input to the simulation, or 
36% of the 110 potentially detectable [Cil] lines in the 
superposed spectra. 

The algorithm correctly identifies half of the 390 peaks 
in the superposed spectra. The remaining 190 peaks are 
either incorrectly identified (160 peaks) or not identi¬ 
fied (30 peaks). False identifications typically result from 
blended lines and are spread uniformly across all 17 can¬ 
didate lines (i.e. with 5-10 false identifications for each 
line in Figure |TJ . The error rate is a function of rela¬ 
tive line intensity, with the probability of a correct iden¬ 
tification near unity for the brightest 10-15 lines and 
remaining above 75% for lines in the brightest half of 
the observed spectrum (30-40 lines for the parameters 
in Figure [3]). Lines in the faintest third of the observable 
spectrum are more likely to generate a false identification 
than a true identification. Restricting the algorithm to 
searching only the brightest half of the observed set of 
lines reduces the number of correctly-identified [Cil] lines 
from 40 to 38 while yielding only 3 false identifications, 
for an error rate of 7%. 

Incorrect line identification results from the chance 
spectral alignment of one line from one galaxy with a 
different line from a different galaxy. The spectral den¬ 
sity of lines increases at fainter intensities (Fig [2]), in¬ 
creasing the probability for such alignment. Restricting 
potential “parent” lines to the brightest half of observed 
lines largely eliminates this problem without sacrificing 
correct identifications. Incorrectly identified lines may 
also be flagged through anomalous line ratios, although 
in this work we do not apply such a line ratio test. 

3 . 2 . Spectral Resolution 

The CLEAN algorithm identifies [Cil] lines by searching 
for predicted “companion” lines among the set of ob¬ 
served spectral lines. As the number of lines increases, 
confusion from overlapping or blended lines can create er¬ 
rors in the position of the fitted line centers to limit the 
efficiency of the algorithm. We use simulated spectra to 
assess the instrumental parameters required to mitigate 
line confusion. 

The number of lines depends on the number of galax¬ 
ies within a spatially unresolved observation. Decreas- 

2 Working with noiseless simulations, we define a “peak” as any 
spectral bin brighter than two neighbors on either side. The effects 
of coarser instrumental resolution are discussed in § 3.2 
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ing the angular resolution increases both the number of 
galaxies observed and the spectral density of emission 
lines. Observing more spectral lines in turn increases 
the probability that lines from different source redsliifts 
will overlap in wavelength, requiring higher spectral res¬ 
olution to distinguish individual lines. 

We quantify the spectroscopic resolution as a function 
of the angular resolution. For a given angular resolution, 
we simulate the galaxy population and generate the su¬ 
perposed spectra (including rotational broadening) from 
all sources using the species in Figure [T] We identify all 
peaks in the resulting (noiseless) spectra and compare 
the separation AA between adjacent peaks to the wave¬ 
length A to determine the spectral resolution R = A/AA 
needed to distinguish the two as separate lines. 

Figure [I] shows the fraction of the observed spectral 
peaks whose separation AA from the neighboring peak 
is greater than than 2A /R, where the factor of 2 ac¬ 
counts for the need to observe two distinct peaks. At 
low spectral resolution, the lines are heavily blended so 
that only the few brightest lines can be distinguished. 
As the instrument spectral resolution increases, progres¬ 
sively more of the lines can be distinguished. The spec¬ 
tral resolution needed to distinguish separate peaks de¬ 
pends on the angular resolution: decreasing the angular 
resolution reduces the number of galaxies observed, pro¬ 
ducing correspondingly greater separation between adja¬ 
cent peaks in the observed spectra. Spectral resolution 
R = 700 (velocity resolution 430 km s -1 ) successfully 
resolves 90% of the peaks in the blended spectra for a 
20" resolution. A higher resolution R = 4000 (75 km 
s" 1 ) is required to resolve 90% of the peaks for 10' reso¬ 
lution. Rotational line widths of a few hundred km s^ 1 
effectively limit the resolution to values R < 5000 inde¬ 
pendent of the angular resolution. Increasing the spec¬ 
tral resolution above R = 5000 does not distinguish ad¬ 
ditional lines, as the observed spectra are then limited 



Fig. 4.— Fraction of adjacent peaks in the simulated spectra 
that can be resolved at spectral resolution R. Results are shown 
for angular resolution 0.3, 1, 3, and 10 arc-minutes (left to right). 
Observations at modest spectral resolution (R ~ 3000) resolve as 
much of the blended spectrum as possible. The superposition of 
rotationally broadened lines from multiple sources creates an effec¬ 
tive ceiling at R ~ 5000. Observations at higher spectral resolution 
do not distinguish additional lines. 



Fig. 5.— Flux from the red-shifted [Cn] line (solid line) as a 
function of observing wavelength. The mean flux of the brighter 
companion lines from each [Cn] source is shown as a dotted line, 
while the dashed line shows the flux of competing emission from 
unrelated interloper lines. 

by intrinsic source blending and not by the instrument. 

3.3. Sensitivity 

The line identification algorithm requires sufficient in¬ 
strumental sensitivity to detect both the [Cn] line as well 
as one or more companion lines. Figure [S] shows the typ¬ 
ical line flux as a function of observing wavelength. For 
each [Cil] source in the simulation, we compute the [Cn] 
line intensity as well as the mean intensity of the four 
companion lines ([Oi] 146 fin 1 , [Om] 88/zm, [Nil] 205/im, 
and [Nil] 122/rm) most often used by the algorithm for 
successful [Cli] identifications. To the extent that phys¬ 
ical conditions in the interstellar medium remain con¬ 
stant, the ratio of companion to [Cli] line intensity re¬ 
mains constant so that the observed line fluxes simply 
fall at the longer observing wavelengths in accordance 
with the higher source redshift. Interloper lines, how¬ 
ever, include sources at multiple redshifts unrelated to 
the [Cli] source redshift and thus fall more slowly with 
wavelength. 

The companion lines are roughly a factor of 10 fainter 
than the [Cli] line. Observations using the CLEAN al¬ 
gorithm to determine source redshift must retain suffi¬ 
cient sensitivity to observe these fainter companion lines 
in addition to the targeted [Cli] lines. Interloper lines 
become brighter than the companion lines for wave¬ 
lengths A > 350 fim corresponding to [Cli] source red¬ 
shift z s > 1. Observations at longer wavelengths tar¬ 
geting higher source redshifts require sufficient spectral 
resolution to distinguish the companion lines from the 
brighter interloper lines (Figure 0}. 

The requirement to distinguish at least one identi¬ 
fiable companion line for each [Cli] line implies that 
the sensitivity and integration times for a blind survey 
are determined by the fainter companion lines and not 
the targeted [Cil] lines. To reduce the total observing 
time needed to map a given field, surveys are likely to 
employ multiple spectrometers simultaneously observing 
independent lines of sight. Such an arrang ement has 
been envisioned fo r future surveys iSilva et, ahl (|2014f) : 
lUzsril et, al. I (|2014fl 
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4. PENCIL-BEAM SURVEY 

The ubiquity and intensity of line emission from ion¬ 
ized carbon makes the [Cll] line a promising probe 
of the high-redshift universe. We may thus antici¬ 
pate future surveys using the [Cll] line for redsliifts 
z < 10 to probe large scale struct ure, galax y evo lu¬ 
tion. and the epoch of r eionization (lSugmj3hara et_ai~] 
19991: iRighi et al. 1 120081: IVisbal Trac. and Loeb 120111 : 


Gong et al. 2012 ; Silva et, al. 112014 lllzgil et al. II2014H . 
Instrumental requirements for such a survey, however, 
are not clear. Observations from ground-based plat¬ 
forms (e.g. ALMA or PdBI) allow large collecting areas 
but are limited to the available atmospheric windows. 
Even within these windows, atmospheric emission is 6 
orders of magnitude brighter than the astrophysical fore¬ 
grounds. The resulting photon noise limits sensitivity, 
requiring integration times of 500 hours to detect a hand- 
ful of sources within a co smologically interesting volume 
(|da Cunha et al. ll2013aT) . 

Observations from a balloon or space platform can mit¬ 
igate atmospheric emission at the cost of limiting the 
size of the beam-forming optics. The smaller collecting 
area reduces sensitivity to individual unresolved sources, 
while the coarser angular resolution increases confusion 
from multip le so urc es w ithin each angular resolution ele¬ 
ment. lllzgil et al. 1 (120141 1 discuss intensity mapping from 
a balloon platform as one way to cope with these prob¬ 
lems. Here we consider the complementary approach of 
a pencil-beam survey from a small millimeter-wave tele¬ 
scope. A pencil-beam survey allows a single instrument 
to measure multiple galaxies within each line of sight, us¬ 
ing spectroscopy to separate individual sources and de¬ 
termine source redshifts. 

Figure [G] shows the number of [Cll] lines that could be 
identified within a single pointing for a noiseless instru¬ 
ment. We simulate a single 2' beam observed with spec¬ 
tral resolution R = 3000 over one octave in wavelength 
and use the CLEAN algorithm to identify lines within the 
resulting spectra. We examine 3 cases for spectral bands 
300-600 /im, 500-1000 pm, and 700-1400 pm correspond¬ 
ing to targeted [Cll] redshifts z = 2, 4, and 6. The upper 
panel shows the cumulative number count N(> S ) for 
all [Cu] lines brighter than threshold S that were iden¬ 
tified by the algorithm. The broad spectral band allows 
the number counts to include sources over a range of 
redshifts. The bottom panel shows the number counts 
for the subset of detected lines within 20% of the tar¬ 
get redshift. A single pointing at 5cr threshold of 0.3 
mJy can potentially identify 20-40 redshifted [Cll] lines 
(plus a comparable number of companion lines) within a 
2' beam. 

We illustrate the potential utility of a pencil-beam sur¬ 
vey using a toy model of the star formation history. We 
compare a model with no luminosity evolution to a sec¬ 
ond model for which the typical luminosity increases by a 
factor of 5 at redshift z p . Figure [7] shows the result. The 
recovered luminosity distributions accurately reflect the 
toy model luminosity evolution. Such a pencil-beam sur¬ 
vey easily distinguishes a model in which the luminosity 
peaks at z p ~ 2.5 from a model with no evolution. 


5. CONCLUSIONS 
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Fig. 6.— Number of identified [Cll] lines as a function of line 
intensity. Solid lines show the cumulative [Cll] number counts de¬ 
rived from simulations of a single 2' beam observed over one octave 
in wavelength centered at redshifts 2 (top curve, blue), 4 (green), or 
6 (bottom curve, red). Dashed lines show the number of false iden¬ 
tifications. Companion lines are not shown. (Top) Total number of 
[Cn] detections over the full wavelength band. (Bottom) Subset of 
detected lines with redshift lying within 20% of the target redshift. 


The [Cll] line is a principal coolant of the interstel¬ 
lar medium and is observable at cosmological distances. 
However, the [Cu] line is observed against a forest of 
competing lines from sources at multiple redshifts. We 
use Monte Carlo simulations of the galaxy distribution 
at redshifts 0 < z < 9 to generate realizations of the far- 
IR spectra including emission from 17 of the brightest 
far-IR lines. The resulting spectra provide a test bed to 
evaluate the use of the [Cll] line as a cosmological probe 
in blind pencil-beam surveys. 

The superposition of rotationally-broadened lines from 
multiple galaxies at different redshifts creates a complex 
spectrum. The simplest assumption, that the brightest 
lines are always [Cu] lines, is at best a coarse approxima¬ 
tion to the simulated spectra. The fraction of [Cll] lines 
among the brightest 10% of all observed lines falls from 
0.8 for observations near 315 /iin (z s = 1) to 0.4 at 1100 
/im (z s = 6). Line identification based solely on relative 
intensity produces a high error rate. 

A variant of the radio-astronomical CLEAN algorithm 
yields superior results. The algorithm selects the bright¬ 
est line from an observed set of spectral lines, tenta¬ 
tively identifies the line as [Cu], and uses the resulting 
source redshift to predict the observed wavelengths for 
redshifted line emission from other bright far-IR lines 
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Fig. 7.— Galaxy luminosity distribution derived from a simu¬ 
lated pencil-beam survey observing the [C n] line over one octave 
in wavelength at spectral resolution R = 3000. The recovered lu¬ 
minosity distribution (solid line) accurately follows the toy model 
(dotted curve). The dashed line shows the recovered luminosity 
distribution for a model without luminosity evolution. 

from the same source. If the predicted lines are observed, 
the source identification is confirmed and the [Cil] line 
and redshifted companion lines are flagged and removed 
from the list of unknown lines. If the predicted lines are 
not observed, the source identification is not confirmed 
and the bright source line remains on the list of unknown 
lines for possible later identification as a companion line 
for a source at a different redshift. The algorithm then 
proceeds to the next brightest remaining line and iterates 
until all lines have been processed. 

A typical spectrum for 2' angular resolution contains 
some 1600 spectral lines within one octave, originating 
from 550 different galaxies. The superposition of rota- 
tionally broadened lines reduces this to roughly 390 dis¬ 
tinct peaks in the observed spectrum, 110 of which result 
from a single un-blended [Cil] line. The line identifi¬ 
cation algorithm correctly identifies 40 of the 110 [Cil] 
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line peaks for an efficiency of order 36% (not counting 
companion lines identified from the same host galaxies). 
Chance alignments of lines lead the algorithm to incor¬ 
rectly identify 3 peaks as [Cil] lines; however, the incor¬ 
rect identifications result almost exclusively from fainter 
candidate peaks. Restricting the algorithm to the bright¬ 
est third of the observed peaks (corresponding roughly 
to threshold 0.3 mJy) increases the percentage of correct 
identifications above 90%. Correct line identification re¬ 
quires detection of both the targeted “parent” line and 
one or more “companion” lines from the same source. 
Survey sensitivity thresholds are thus determined by the 
expected intensities of the companion lines and not solely 
by the brighter [Cil] line. 

The ability to distinguish individual peaks in the su¬ 
perposed spectra depends on the spectral resolution and 
angular resolution. Decreasing the angular resolution in¬ 
creases the number of lines in the observed spectra, re¬ 
quiring higher spectral resolution to minimize blending of 
neighboring lines. The required resolutions are modest. 
Spectral resolution R = 700 resolves 90% of the peaks 
for 20" angular resolution, while R = 4000 resolves 90% 
of the peaks for 10' angular resolution or coarser. 

The [Cn] line is a promising cosmological probe. The 
large number of potential sources observed within a sin¬ 
gle pencil beam combined with an efficient and accurate 
line identification algorithm allow rapid characterization 
of the galaxy luminosity distribution to redshifts encom¬ 
passing reionization. A simulated pencil-beam survey 
demonstrates the ability to distinguish toy models of 
galaxy evolution within a single angular resolution ele¬ 
ment. The required angular resolution, sensitivity, wave¬ 
length range, and spectroscopic resolution appear achiev¬ 
able, suggesting that far-IR lines may soon provide a new 
tool to map the large-scale galaxy distribution and evo¬ 
lution for redshifts 0 < z s < 10. 
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